MU-GAN: Facial Attribute Editing Based on Multi-Attention Mechanism

نویسندگان

چکیده

Facial attribute editing has mainly two objectives: 1) translating image from a source domain to target one, and 2) only changing the facial regions related preserving attribute-excluding details. In this work, we propose multi-attention U-Net-based generative adversarial network (MU-GAN). First, replace classic convolutional encoder-decoder with symmetric U-Net-like structure in generator, then apply an additive attention mechanism build attention-based U-Net connections for adaptively transferring encoder representations complement decoder detail enhance ability. Second, self-attention (SA) is incorporated into layers modeling long-range multi-level dependencies across regions. Experimental results indicate that our method capable of balancing ability details preservation ability, can decouple correlation among attributes. It outperforms state-of-the-art methods terms manipulation accuracy quality. Our code available at https://github.com/SuSir1996/MU-GAN.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Arbitrary Facial Attribute Editing: Only Change What You Want

Facial attribute editing aims to modify either single or multiple attributes on a face image. Since it is practically infeasible to collect images with arbitrarily specified attributes for each person, the generative adversarial net (GAN) and the encoder-decoder architecture are usually incorporated to handle this task. With the encoder-decoder architecture, arbitrary attribute editing can then...

متن کامل

Video Editing Based on Behaviors-for-attention

In desktop manipulations such as cooking or assembly instruction videos, a performer often draws the viewers’ attention to significant portions of the video’s content by means of a specific set of physical behaviors. We verified the efficacy of videos edited based on those behaviors by observing television programs and simulating them with the use of our editing scheme. In this paper, we first ...

متن کامل

DNA-GAN: Learning Disentangled Representations from Multi-Attribute Images

Disentangling factors of variation has always been a challenging problem in representation learning. Existing algorithms suffer from many limitations, such as unpredictable disentangling factors, bad quality of generated images from encodings, lack of identity information, etc. In this paper, we propose a supervised algorithm called DNA-GAN trying to disentangle different attributes of images. ...

متن کامل

Attention Based Facial Symmetry Detection

Symmetry is a fundamental structure that is found to some extent in all images. It is thought to be an important factor in the human visual system for obtaining understanding and extracting semantics from visual material. This paper describes a method of detecting axes of reflective symmetry in faces that does not require prior assumptions about the image being analysed. The approach is derived...

متن کامل

Facial Expression Recognition Based on Structural Changes in Facial Skin

Facial expressions are the most powerful and direct means of presenting human emotions and feelings and offer a window into a persons’ state of mind. In recent years, the study of facial expression and recognition has gained prominence; as industry and services are keen on expanding on the potential advantages of facial recognition technology. As machine vision and artificial intelligence advan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE/CAA Journal of Automatica Sinica

سال: 2021

ISSN: ['2329-9274', '2329-9266']

DOI: https://doi.org/10.1109/jas.2020.1003390